Matrix factorization over max-times algebra for data mining

نویسندگان

  • Sanjar Karaev
  • Pauli Miettinen
  • Gerhard Weikum
چکیده

Decomposing a given matrix into two factor matrices is a frequently used technique in data mining for uncovering underlying latent patterns in the data. Unlike in pure mathematics, the emphasis is put on obtaining results that are interpretable, rather than necessarily having a small reconstruction error. One approach to increase interpretability is to pose constraints on the factors. For example they might be restricted to the same type as the original matrix. Among many different ways one can define matrix multiplication, the standard and Boolean cases are the most thoroughly studied. In this work we introduce matrix multiplication over max-times algebra, which is the set of nonnegative real numbers endowed with standard multiplication, but with addition being replaced by the maximization operation. The main objective of the thesis is to develop efficient factorization algorithms for this newly defined matrix multiplication. We propose several methods for solving this problem. The choice of a particular algorithm depends on the nature of the data, in particular its density. The sparser the matrices, the closer their max-multiplication is to the standard matrix multiplication, and for extremely sparse data even an algorithm for nonnegative matrix factorization can be quite good. However, for other cases algorithms that are specifically tailored for max-times data are required. The max matrix factorization problem is hard, and solving the problem precisely for large input sizes is infeasible, which means that approximate methods should be sought. It turned out in the experiments that the most promising approach is to relax the objective in such way that it becomes differentiable, after which convex optimization algorithms (e.g. gradient descent) can be used. This produces relatively good results when the max-times structure is present in the data.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Capricorn: An Algorithm for Subtropical Matrix Factorization

Max-times algebra, sometimes known as subtropical algebra, is a semi-ring over the nonnegative real numbers where the addition operation is the max function and the multiplication is the standard one. Factorizing a nonnegative matrix over the maxtimes algebra, instead of the standard (nonnegative) one, allows us to find structures and regularities that cannot be easily expressed in the standard...

متن کامل

A Projected Alternating Least square Approach for Computation of Nonnegative Matrix Factorization

Nonnegative matrix factorization (NMF) is a common method in data mining that have been used in different applications as a dimension reduction, classification or clustering method. Methods in alternating least square (ALS) approach usually used to solve this non-convex minimization problem.  At each step of ALS algorithms two convex least square problems should be solved, which causes high com...

متن کامل

Algorithms for Approximate Subtropical Matrix Factorization

Matrix factorization methods are important tools in data mining and analysis. They can be used for many tasks, ranging from dimensionality reduction to visualization. In this paper we concentrate on the use of matrix factorizations for finding patterns from the data. Rather than using the standard algebra – and the summation of the rank-1 components to build the approximation of the original ma...

متن کامل

Image Compression Method Based on QR-Wavelet ‎Transformation

In this paper, a procedure is reported that discuss how linear algebra can be used in image compression. The basic idea is that each image can be represented as a matrix. We apply linear algebra (QR ‎factorization and wavelet ‎transformation ‎algorithm‏s) on this matrix and get a reduced matrix out such that the image corresponding to this reduced matrix requires much less storage space than th...

متن کامل

Minimal solution of fuzzy neutrosophic soft matrix

The aim of this article is to study the concept of unique solvability of max-min fuzzy neutrosophic soft matrix equation and strong regularity of fuzzy neutrosophic soft matrices over Fuzzy Neutrosophic Soft Algebra (FNSA). A Fuzzy Neutrosophic Soft Matrix (FNSM) is said to have Strong, Linear Independent (SLI) column (or, in the case of fuzzy neutrosophic soft square matrices, to be strongly r...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013